CDS

Accession Number TCMCG075C01343
gbkey CDS
Protein Id XP_017977769.1
Location join(6820382..6820657,6821395..6822192,6823711..6823887,6824159..6824250,6824905..6825003,6825201..6825676,6825773..6825816,6825902..6826168,6826565..6826712,6827421..6827537,6828085..6828185,6828333..6828453,6828648..6828760,6829079..6829190,6829376..6829480,6829570..6829636,6829743..6829863,6829974..6830147,6830532..6830649,6830732..6831273)
Gene LOC18611791
GeneID 18611791
Organism Theobroma cacao

Protein

Length 1355aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA341501
db_source XM_018122280.1
Definition PREDICTED: protein TONSOKU isoform X1 [Theobroma cacao]

EGGNOG-MAPPER Annotation

COG_category L
Description bru1,mgo3,tsk
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE -
KEGG_ko -
EC -
KEGG_Pathway -
GOs GO:0000086        [VIEW IN EMBL-EBI]
GO:0000278        [VIEW IN EMBL-EBI]
GO:0005575        [VIEW IN EMBL-EBI]
GO:0005622        [VIEW IN EMBL-EBI]
GO:0005623        [VIEW IN EMBL-EBI]
GO:0005634        [VIEW IN EMBL-EBI]
GO:0007049        [VIEW IN EMBL-EBI]
GO:0008150        [VIEW IN EMBL-EBI]
GO:0009653        [VIEW IN EMBL-EBI]
GO:0009888        [VIEW IN EMBL-EBI]
GO:0009933        [VIEW IN EMBL-EBI]
GO:0009934        [VIEW IN EMBL-EBI]
GO:0009987        [VIEW IN EMBL-EBI]
GO:0016020        [VIEW IN EMBL-EBI]
GO:0022402        [VIEW IN EMBL-EBI]
GO:0022603        [VIEW IN EMBL-EBI]
GO:0032502        [VIEW IN EMBL-EBI]
GO:0043226        [VIEW IN EMBL-EBI]
GO:0043227        [VIEW IN EMBL-EBI]
GO:0043229        [VIEW IN EMBL-EBI]
GO:0043231        [VIEW IN EMBL-EBI]
GO:0044424        [VIEW IN EMBL-EBI]
GO:0044464        [VIEW IN EMBL-EBI]
GO:0044770        [VIEW IN EMBL-EBI]
GO:0044772        [VIEW IN EMBL-EBI]
GO:0044839        [VIEW IN EMBL-EBI]
GO:0048507        [VIEW IN EMBL-EBI]
GO:0048509        [VIEW IN EMBL-EBI]
GO:0048532        [VIEW IN EMBL-EBI]
GO:0048856        [VIEW IN EMBL-EBI]
GO:0050789        [VIEW IN EMBL-EBI]
GO:0050793        [VIEW IN EMBL-EBI]
GO:0051301        [VIEW IN EMBL-EBI]
GO:0065007        [VIEW IN EMBL-EBI]
GO:1903047        [VIEW IN EMBL-EBI]

Sequence

CDS:  
ATGGCTCGCGAAGAGGAATTGCAGATAAACATGGCGAAGCGTGCGTACAGGAAAGCCAAAGAGGAAGGGAATAGGCAAGAAGAGGCAAGGTGGGCAAATGTAATTGGAGATATTCTGAAGAACAGAGGAGAGTACGTGGAGGCACTCAAGTGGTTTCGAATCGATTACGATATTTCGAACAAGTATTTGCCTGAGAAACAGTTGCTTCCCACTTGTCAGGCTCTAGGTGAGGTCTATCTTAGGTTGGAAGACTACCAGGATGCGTTGATTTATCAGAAAAAACATTTGGATCTTGCTAAGGATGCCAATGATCTTGTTGAGCAGCAAAGAGCCAGCACACAACTTGGGCGTACCTACCATGAAATGTTTTTAAAATCTGATGGTGACCATTACTCAGTTCGGAAAGCCAAAAAATATTTTAAGTGTGCCATGGAGCTTGCTCAGACTCTCAAGGAGAACCCCCCTAATAATAAGTCTTCTTTTCGCAAAGAGTATATTGATGCACACAATAATATTGGTATGCTAGAAATGGACCTTGATAACTTGGATGAAGCACTAAAATTCCTCACTAAAGGTTTGGAGATATGTGATGAAGAGGAAGTCGCTGAGGATGATGATGGGCGTAGTAGGCTTCATCACAATCTGGGAAATGTTTATATTGAGTTGAGGAGGTGGGACAAAGCACGTGAACACATTGAGAAAGACATAATAATTTGTAAGAGAATTGGGCACTGCCAAGGAGAGGCAAAAGGGTACATTAATCTTGGTGAGTTGCATTACAGGGTTCAAAAGTATGATGAGGCAATACTTTGCTACCAGAAGGCACTTGATTTGGCAAAATCCATGGAAGATGAGGATGCTTTAGCAGCACAAATCAATCAAAATATCAATACTGTAAAGGAAGCTATTAATGTAATGAATGAACTGAAAAAGGAAGAGCAGAATCTCAAGAAGCTTAGAAGAAAGATGGTTACTGCAAAAGGAACACCGGAGGAGCGTAAGTTTCTGCTACAGCAGAATTCATGTCTAGATCGTCTCATTGAGAAATCAGCTATGATCTTTGCATGGCTGAAGCATCGTGAATTTGCCAAAAGGAAGAAGAGCATAGCAAGTGAACTTTGTGATAAGGAAAAGCTGAGTGATGCATTTTTAGTTGTTGGAGAATCATACCAGAAGCTCAGGAACTTCAGCAAAGCCATTAAATGGTACACAAAGAGTTGGGAAGGATACAAGTTAATTGGAAATCTGGAGGGTCAAGCATTGGCAAAAATCAATATTGGTGATGTTTTGGACTGCAGTGGTGATTGGACTGGAGCACTTGAGGCATTTGAAGAGGGCTGCAGGATTGCTGCTAAAGCTAAGCTTCCTTCTGTTCAGATTTCTGCATTGGAGAATATGCATTATAGCCACATGATAAGATTTGACCATGTTGAAGAGGCCAGGAGGTTGCAGCTTGAAATTGACAAATTGAAACAGTCAAAAACTAAGGAGCTTGAAGCAAAACATGTCACAATGGATTGCTGCTCTGAAACTGACACAGAAGGGGATGATCATTGCTCAGATAACATGTCTAATGCATACTCAGGGGTGATGAGTAAATCCAATTCTAACAAATCAGCATCTTTAGCTGCTAGTGGAGAGCTGAATGATGATTTACCTCTAATTTCACTTATCCGACCCAGCAAAAGATCATCCAAAACTGAAACAGCCCCCTCAGGAAAATACAATATTTCTACTGAGCCAGATGAAGCTTTTCCAAGTAGTTTGTCCAAATCAACAAGAAATCAGCAAACAGTTGTTGGTCGTAAACGTGTTCGTGTAGTCCTCTCGGATGATGAAGGTGACATGCATGATGAAGTAGAAGGCTCTGCAGGAAGGCTGCATGAATGTCCAGTTAATGTTGCTGCTTCAAACGAATTTAAGAGTAAAAGTGGTCCTGTCCGTTCTGATTATAAATCTCAGGATTTCTCCCCAGTTCCCTCCAAAAGTCCCATCCAGCACTGCAATCTTGTCAATATTGAAGAAAGCATTTGTTCATACAAATCTGTGAGCCCCAATATAACTGTTTCGAATGGAAAAATTAACAGATCTTTAAGTGATGCCGAAGTTGTTATTGTTTCTGGTTTTGCTGCTAGCACTTTGAAATGTGACATCAATGCCTCTGAGAACTTACTGCACGAACATAATGCTCCTCATCTCAAGTTGCATACTACTGATAATGAGGATGATGGCTGCATAACATTCAAAGTTGATACCAGCATGATAAATGTGGAATTGAGCTCATTTATGATTGGCGATAAGATAAGCATGGAGTCTTTGGAGGTTGAACTAGCTTGCTTATATTATTTGCAGCTTCCTGTAGAGAAGAGATCAAAGGGTCTGTTGCCCATCATTCAGAACATGGAATGTGGTGGGAGACCTCTGGAATCCTTTGAAAACTTTGATGCTCTTAGGAATCATATGAGGAAGGTTTTGGTTGATGTTGTTATTGATGGATGGGTCCAGAAGCGCTTGATGAAATTGTATATTGACAGCTGCAATGAATTATCTGAGGCACCAAATATGAAGTTGCTTAAAAAATTGTATGTTTCAGAGATAGAGGATGACGTTGATGTGTCTGAATGTGAACTGCAAGATATATCAGTAACTCCATTGTTGAATGCCTTGTATACACATAAAGGGGTTGCCATGCTAGACCTTTCTCACAACTTGTTAGGAAATGCAACAATGGAGAAACTCCAACAATTTTTTTCATCATCAGGCCAAAAATATGCTGACTTGACTTTGGATTTGCATTGCAATCGATTTGGTCCTACTGCTTTATTCCAGATTTGTGAGTGCCCTGTGCTGTTCACTCGACTTGAAGTACTTAATATCTCTGGTAATCGTCTCACTGATGCATGTGGATCATATTTATCAACCATCCTTGAAAAGTGCAAAGCCCTGTTCAGCTTGAATGTAGAGCGCTGTTCCATTACTTCCAGAACAATTCAAAAGGTTGCTGATGCACTTGGTACTGGATCTGCCCTATCACAACTTCTCATAGGATATAACAACCCTATATCTGGGAATGCCATCACTAATTTATTGGGCAAACTTGCTAAAATGAAAAGATTTTCAGATCTTAGTTTAAATGGCTTAAAACTAAGCAAGCCTGTGGTTGATGGCCTTTGCTACCTTGCTAAAACCTCATGTCTGTCACGATTGATGCTTGAGGGCACTGGTATAGGAACTGATGGAGCACTAGGATTGACTCAATCATTGTTCAGCTCAACTCAGGAGCCTTTGAAACTTGATCTGTCTTATTGTGGAGTAACATCTACATACGTTTATCAACTTAACACCGATGTTACTTTCATTAGTGGCATTCTTGAGCTCAACCTTGGAGGAAATCCCATTATGCTAGAGGGTGGCAATGCATTAGCATCACTGCTTATCAATCCTCAATGTTGTCTAAAAGCTTTGATCCTGAACAAGTGTCAGCTGGGGATGGCTGGAATTCTTCAAATAATTCAGGCACTAGCAGAGAATGACTCTCTAGAAGAGCTCAATCTTGCTGATAATGCTGATACGAATAAGCAGTTAACTATACAATGTGATAAATTGACAAAAGAGAGCTCAGAGTACTTACAGCCAGACCACACCATATCTGAACCGTATCTTAATCAATGTGTTTCCAAGGAATGTGATGTTGAGCAGGGCATGTGTGTTATTAACGCAGACTGCTGTAAGCTTGAAGTTGCTGATAGCGAAGATGATGAGGTAAGGGTAGGAACAGCTGCATGTGAGTTCGATGACAGCTGTGCAAGCTCATGCCAAAGAAATTCGTCTATGGAGTGCCAGTTCATTCAAGATCTTTCCACTGCCATTGGCATGGTAAAGCAGTTGCAGGTGTTAGATCTTAGCAATAATGGGTTCTCAGTTGAAGCTTCCGAAGCTTTATTCAATGCTTGGTCATCAGGTTCAAGGGTTGGTTTAGCTTGGCGGCACATTGATAATCAGACAATCCATTTGTCAGTGGAAGTAAATAAGTGTTGTAGAGTCAAATCCTGCTGCAAAAAGGATTAA
Protein:  
MAREEELQINMAKRAYRKAKEEGNRQEEARWANVIGDILKNRGEYVEALKWFRIDYDISNKYLPEKQLLPTCQALGEVYLRLEDYQDALIYQKKHLDLAKDANDLVEQQRASTQLGRTYHEMFLKSDGDHYSVRKAKKYFKCAMELAQTLKENPPNNKSSFRKEYIDAHNNIGMLEMDLDNLDEALKFLTKGLEICDEEEVAEDDDGRSRLHHNLGNVYIELRRWDKAREHIEKDIIICKRIGHCQGEAKGYINLGELHYRVQKYDEAILCYQKALDLAKSMEDEDALAAQINQNINTVKEAINVMNELKKEEQNLKKLRRKMVTAKGTPEERKFLLQQNSCLDRLIEKSAMIFAWLKHREFAKRKKSIASELCDKEKLSDAFLVVGESYQKLRNFSKAIKWYTKSWEGYKLIGNLEGQALAKINIGDVLDCSGDWTGALEAFEEGCRIAAKAKLPSVQISALENMHYSHMIRFDHVEEARRLQLEIDKLKQSKTKELEAKHVTMDCCSETDTEGDDHCSDNMSNAYSGVMSKSNSNKSASLAASGELNDDLPLISLIRPSKRSSKTETAPSGKYNISTEPDEAFPSSLSKSTRNQQTVVGRKRVRVVLSDDEGDMHDEVEGSAGRLHECPVNVAASNEFKSKSGPVRSDYKSQDFSPVPSKSPIQHCNLVNIEESICSYKSVSPNITVSNGKINRSLSDAEVVIVSGFAASTLKCDINASENLLHEHNAPHLKLHTTDNEDDGCITFKVDTSMINVELSSFMIGDKISMESLEVELACLYYLQLPVEKRSKGLLPIIQNMECGGRPLESFENFDALRNHMRKVLVDVVIDGWVQKRLMKLYIDSCNELSEAPNMKLLKKLYVSEIEDDVDVSECELQDISVTPLLNALYTHKGVAMLDLSHNLLGNATMEKLQQFFSSSGQKYADLTLDLHCNRFGPTALFQICECPVLFTRLEVLNISGNRLTDACGSYLSTILEKCKALFSLNVERCSITSRTIQKVADALGTGSALSQLLIGYNNPISGNAITNLLGKLAKMKRFSDLSLNGLKLSKPVVDGLCYLAKTSCLSRLMLEGTGIGTDGALGLTQSLFSSTQEPLKLDLSYCGVTSTYVYQLNTDVTFISGILELNLGGNPIMLEGGNALASLLINPQCCLKALILNKCQLGMAGILQIIQALAENDSLEELNLADNADTNKQLTIQCDKLTKESSEYLQPDHTISEPYLNQCVSKECDVEQGMCVINADCCKLEVADSEDDEVRVGTAACEFDDSCASSCQRNSSMECQFIQDLSTAIGMVKQLQVLDLSNNGFSVEASEALFNAWSSGSRVGLAWRHIDNQTIHLSVEVNKCCRVKSCCKKD